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Much of economic theory is built on observations of aggregate, rather than individual, behavior. 
Here, we present novel findings on human shopping patterns at the resolution of a single purchase. 
Our results suggest that much of our seemingly elective activity is actually driven by simple routines. 
While the interleaving of shopping events creates randomness at the small scale, on the whole 
consumer behavior is largely predictable. We also examine income-dependent differences in how 
people shop, and find that wealthy individuals are more likely to bundle shopping trips. These 
results validate previous work on mobility from cell phone data, while describing the unpredictability 
of behavior at higher resolution. 



Human economic behavior is curbed by human geogra- 
phy: constraints on mobility determine where we can go 
and what we can buy. At the same time, the elect ivity of 
shopping itself drives our movement. Results from mo- 
bile phone data have shown that human mobility follows 
a truncated Levy flight pQ and that the unpredictabil- 
ity of individual trajectories is bounded [2]. But does 
this predictability hold from one dataset to the next? To 
what extent do existing routines drive where one shops, 
and to what extent do explicit decisions shape our shop- 
ping patterns? 

We consider a random sample of 10, 000 accounts from 
a major financial institution, over a three-month win- 
dow in 2010. The anonymized dataset includes all check, 
credit card, and debit card transactions, as well as cash 
withdrawals and automatic wire payments. Merchants 
are identified by name and by Merchant Category Code 
(MCC) [7], and we estimate the income of individuals 
from payments made into the account. 

The sequence of shops frequented by a single individ- 
ual can be described as a network where nodes are mer- 
chants and edge weights represent transition probabilities 
(fig. [TJ. Shopping hubs represent locations of frequent 
purchase, and are often grocery stores, gas stations or 
convenience stores. We are also able to plot population- 
level transition probabilities between merchants of differ- 
ent types (fig. [2]). 

Our findings suggest that at longer time scales, shop- 
ping behavior is constrained by some of the same features 
that have been seen to govern mobility generally. Shop- 
pers return to stores with remarkable regularity: a Zipf 's 
law (s = 4 ± 0.031) also describes the probability that a 
customer will visit a store at rank N (where N = 3 is his 
third most-frequented store, for example), independent 
of the total number of stores visited in a three-month 
period. These results support those of Gonzalez et al. 

Similarly, the random (log 2 N{ , where Ni = number of 
locations visited by shopper i) and temporal-uncorrelated 
(Y.iPiU) l °g2PiU), where pi(j) is the probability that 
user i visited location j) entropies of individuals comprise 
a distribution akin to that seen from cell phone patterns, 
albeit with higher mean entropies (fig. [3|. Moreover, 
these entropies are largely stable over four years for the 
same sample of individuals. 
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FIG. 1: Network linking stores frequented by a single indi- 
vidual over a three-months in 2008. Edge weight represents 
the transition probability from one location to another. A 
grocery store serves as the shoppers main hub, and a natural 
grocery store as his secondary hub. Actual store names have 
been replaced with merchant type. 



Yet this predictability falls off at higher temporal res- 
olutions, suggesting that individual choices propel daily 
activity. When we incorporate the sequence of stores fre- 
quented to estimate the true entropy using the average 
Kolmogorov complexity, we find the entropy largely un- 
changed (fig. [3]): that is, the diversity of an individual's 
shopping behavior is driven by the number of stores he 
visits and the frequency at which he visits, not by the 
order in which he shops. This is a departure from the 
findings of Song et al. in cell phone calls, where the or- 
der of sites visited explains a large portion of mobility 
patterns. 

This discrepancy can be explained in part by the effect 
of interleaving store visits in time: I might go to the 
supermarket and then the post office, but I could just 
as well reverse this order. With the large-scale mobility 
patterns inferred from cell phones, I am unable to change 
many of my routines: I drive to the office after I drop off 
the kids at school. Indeed, when we simulate the effect of 




FIG. 2: Network of transitions between stores of different mer- 
chant category codes of 2, 000 individuals in 2008. Individuals 
belong to the top quintile of "most predictable" (lowest en- 
tropy, with total store visits held constant) in the sample. 
Edge weight and color represent the transition probabilities 
from a merchant in one retail category to one in another. For 
instance, 5411 represents grocery stores; a full list of MCC 
codes is available. 




FIG. 3: Distribution of true, random and uncorrelated en- 
tropies for set of 2,000 individuals over three-month window 
in 2010. Relative to Song et al results from mobile phone, 
adding sequence information does not lead to markedly lower 
distribution of true entropies. 



novel orderings by randomizing shopping sequence within 
a day, we find little change in total entropy. However, 
we are able to approach the levels of true entropy seen 
in the mobile phone data by sorting the order of shops 
visited over weekly intervals and thus imposing artificial 
regularity on shopping sequence (fig. [4|. 

We wish to understand, as well, whether there exist 
demographic correlates to these consumer behaviors. In 
contrast to literature on cell phones, which asserts that 
predictability is independent of features such as income, 



FIG. 4: 10, 000 run Monte Carlo simulation on a 2, 000 person 
subsample of effect of (a) randomizing and (b) sorting a set 
of stores visited over a week-long period. Averaging multiple 
runs of shuffled sequence information does not change the 
entropy significantly, while sorting lowers the entropy. 



we observe marked differences in the ways that wealthy 
and poor individuals shop. We segment our sample and 
consider poor those with an annual inflow of less than 
$16,000, and wealthy as those with inflow greater than 
$80, 000. Because we are only able to measure inflows 
into accounts associated with this bank, income here 
serves as a reasonable lower bound on earnings: other 
research suggests that this bound is asymmetric insofar 
as wealthy individuals are more likely to automatically 
direct income to multiple accounts [8]. 

Wealthy people, on average, sho p at a greater variety 
of stores than do their peers (fig. |5(b)[ ), a phenomenon 
that might be attributed to differential access to commer- 
cial opportunities [3 , to a taste for variety capacitated 
by greater discretionary income [4 , or to some combina- 
tion of factors. Because of the variety of t heir sh opping, 
the rich have higher random entropies (fig. |5(a)[ ). An in- 
dividual's temporally uncorrelated entropy, meanwhile, 
depends on the diversity of his store portfolio as well as 
on his distribution of purchases across stores. 

While the wealthy exhibit larger uncorrelated entropies 
than do the poor (fig. |5(a)| ), the difference between ran- 
dom and uncorrelated e ntropi es, on a by-person basis, is 
greater for the rich (fig. 5(c)). That is, a steeper distri- 
bution of store visits dampens the uncorrelated entropy 
of wealthy shoppers, while their tendency toward com- 
mercial variety increases entropy. Conversely, the reg- 
ularity of purchases across shops is more important in 
explaining the predictability of the poor. These results 
are supported by the differential in variance of (time s a 
store visited) between the rich and the rest (fig. |5(d) ). 

Wealthy individuals are more likely to group purchases 
in time as well as in space. We propose the bundling of 
shopping trips as a behavioral explanation for this vari- 
ance. We can think of bundling as a trip comprising 
visits to several stores, germane to the bursty behavior 
seen in other human systems [5 . Field studies [6] have 
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FIG. 5: (a) Distribution of number of stores frequented by sub-samples of wealthy and poor individuals; (b) random and 
temporally-uncorrelated entropies of sub-samples: the rich have higher overall entropies; (c) yet the by-person difference 
between random and uncorrelated entropies is greater for wealthy individuals, suggesting that variety in consumption patterns, 
rather than allotment over time, preferentially contributes to entropy for this group; (d) the greater variance in visits per store 
supports the notion that the entropy of the rich is due to preference for variety; (e) wealthy consumers are more likely to bundle 
together shopping events 



found that when choosing a grocery store, wealthy indi- 
viduals optimize for proximity to other merchants (and 
thus make multiple purchases in the same trip), while 
poor individuals chose stores close to their homes. 

We define bundling as variance over bins representing 
daily activity, and observe a difference in distribution 
of bu ndling level between rich and poor individuals (fig. 

El- 

Holding income constant, we also find differences in 
customers with high and low entropies: while the most 
predictable individuals (entropy in highest quintile) tend 
to have grocery stores and restaurants as their topped- 
rank merchant, their unpredictable counterparts (en- 
tropy in last quintile) frequent fast food chains and gas 
stations. In simulation, there is a 31.4% chance that two 
of the most or least predictable people will be found at 
the same merchant type for any given shopping event, 
but a predictable person will only overlap with an unpre- 
dictable person 11.2% of the time. 

Colloquially, an unpredictable person can exhibit one 
of several patterns: he may be hard to pin down, reli- 
ably late, or merely spontaneous. Yet given an individ- 
ual's record, we seem to be able able to agree on whether 



he is predictable or not. As a more formal measure for 
human behavior, however, information- theoretic entropy 
conflates several of these notions. A person who discov- 
ers new shops and impulsively swipes his card presents a 
different case than the one who routinely distributes his 
purchases between his five favorite coffee shops, yet both 
have high entropy. Entropy remains a useful metric for 
comparisons between individuals and datasets (such as 
in the present study), but further work is need to tease 
out the correlates of predictability using measures aligned 
with observed behavior. 

In examining the solitary footprints that together com- 
prise the invisible hand, we find that shopping, like mobil- 
ity, is a highly predictable behavior at longer time scales. 
However, there exists substantial unpredictability in the 
sequence of shopping events over short time scales. While 
our results use an independent dataset to validate those 
of Song et al, they also highlight the uniqueness of shop- 
ping behavior. A cell phone tower is a waypoint in an 
individual's daily trajectory, but a store is a destination, 
and ultimately, a nexus for human social and economic 
activity. 
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